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Abstract 

We disprove a conjecture in Density Functional Theory, relative to mnltimarginal 
optimal transport maps with Coulomb cost. We also provide examples of maps 
satisfying optimality conditions for special classes of data. 


1 Introduction 

A natural problem in Quantum Physics consists in studying the behavior of N electrons 
subject to the interaction with some nuclei, their mutual interaction and the effect of 
an external potential. In this setting, a relevant quantity is the ground state energy 
of the system, which can be found by solving the Schrodinger equation. However, this 
procedure is computationally very costly even for a small number of electrons; Density 
Functional Theory proposes an alternative method to compute the ground state energy 
and was first introduced by Hohenberg and Kohn [12] and then by Kohn and Sham [13]. 

In [3, 6] the authors present a mathematical model for the strong interaction limit 
of Density Functional Theory; they study the minimal interaction of N electrons and 
the semiclassical limit of DFT. The model is based on Monge multimarginal optimal 
transport (see also the recent survey [9], where the state of the art on this problem is 
described), which consists in the minimization problem 

(M)=inf| f C{x,T 2 {x),...,TN{x))dp{x):T 2 ,...,T^ er{p)], (1.1) 

I JR" J 

where p G ^(M'^) is a given probability measure, C : —)■ [0, oo] is the Coulomb 

interaction 

C{xu...,xn)= X I ^ I V(xi,...,x^) G (M-)"^, (1.2) 

and T{p) is the set of admissible transport maps 

Tip) = {T:W^ Borel : = p} . 
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Since the cost is symmetric, a natural variant of the Monge problem allows only cyclical 
maps 

(Meyci)=inf| f C{x,T{x),...,T^^-^\x))dp{x):Ter{p),T^^^ =ld] 

where with we denote the composition of T with itself for k times. Following the 
standard theory of optimal transport (see [19, 1]), we also introduce the Kantorovich 
problem 

(K)=min< / c(xi,..., xat) (i 7 (xi,..., xat) : 7 G n(p) I , 

[ J(IR")JV J 

where n(/)) is the set of transport plans 

n(p) = { 7 G : 7rj7 = p, i = 1,..., AT } 

and TT* : —?■ M” are the projections on the i-th component for i = 1,..., A^. To 

every (A^ — l)-uple of transport maps T 2 ,... ,T]\f G T{p) we canonically associate the 
transport plan 7 = (Id, T 2 ,..., Tn)^p G n(/ 9 ). As proved in [4], if p is non-atomic the 
values of the minimum problems coincide 

{K) = (M) = (Mcycl). 

Existence of optimal transport plans in {K) follows from a standard compactness 
and lower semicontinuity argument. In turn, existence of optimal maps in (M) is largely 
open; it is understood only with N = 2 marginals in any dimension n and in dimension 
n = 1 with any number N of marginals (see [ 6 ] and [5] respectively). In a different 
context, optimal cyclical maps as in (Mcyd) appear in [ 11 ] for some particular costs 
generated by vector fields. 

As regards uniqueness of optimal symmetric plans with Coulomb cost, it holds in 
dimension 1, but, as shown in [15], it fails in the same class already when we consider 
spherically symmetric densities in M^, for any N. On the other hand, the Kantorovich 
duality works also for this cost (see [16]) and the dual problem admits maximizers 
(namely, Kantorovich’s potentials), as shown by De Pascale [ 8 ]; moreover, in [7] the 
limit of symmetric optimal plans as Ai —)■ 00 is shown to be the infinite product measure 
of p with itself. 

Beyond the 1-dimensional case, which is well understood, a physically relevant case 
is given by spherically symmetric densities p in with any number of marginals. 
In the physics literature, they appear in [17, 18] to study simple atoms like Helium 
{N = 2), Litium (A^ = 3), and Berillium (A^ = 4). In this case the problem reduces, 
thanks to the spherical symmetry, to a problem in 1 -dimension, with a more complicated 
cost function (see [15], where this reduction is rigorously described). In the class of 
admissible transport maps for problem (Mcyd), Seidl, Gori Giorgi and Savin identified 
some particularly simple maps: roughly speaking, they divide M” in N spherical shells, 
each containing one electron in average, and consider the transport maps which send 
each shell onto the next one by a monotonically increasing or decreasing map. They 
conjecture the optimality of one of these maps in (Mcyd). 
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Figure 1: A configuration of three charges at distances ri, r 2 and with angles 02 and 
03- 

In the following, we provide counterexamples to the conjecture showing that there are 
cases in which none of these maps is optimal in problem (Mcyd). On the other hand, we 
also point out situations where some of these maps satisfy optimality conditions, namely 
c-monotonicity. We deal for simplicity with radial measures in with 3 marginals, 
although similar examples and computations can be carried out in any dimension and 
with any number of marginals. 

The plan of the paper is the following. In Section 2 we present the problem with 
spherically symmetric data, we recall the notion of c-monotonicity and a few properties of 
optimal transport maps, and we give some examples and counterexamples. In Sections 3 
and 4 we study the properties of the cost for close radii and for spread apart radii, 
respectively. In Section 5 we apply these properties to give rigorous proofs of the 
examples and counterexamples. 

2 Examples and counterexamples 

2.1 Monge and Kantorovich problems with radial densities 

As we mentioned above, the transport problem (1.1) reduces to a 1-dimensional one (i.e., 
by proving that spheres get mapped to spheres), as rigorously done in [15]. Assuming 
from now on Ai = 3, given three radii ri, r 2 , G M+ = (0, oo), we consider the associated 
exact cost (see Figure 1) 

c(ri,r 2 ,r 3 ) = mini ^- - -r -h ^- - -r -h ^- - -r : \vi\ = n, i = 1,2,3 1 , (2.1) 

L|u2-ui| |u3-i; 2| Ti-usl J 

which is a positive, symmetric, continuous function. Let us denote (0, oo) by M+. Given 
a non-atomic probability measure p G .i^(M+), the set of transport maps reads as 

T{p) = {T : M_|_ —)■ M_|_ Borel : Tjjp = p} , 

and the cyclical Monge problem corresponding to (1.1) can be written as 

(Mcyci) = I ^ c(x, r(x), t(2)(x)) dp{x) : T G T{p), = Id | . (2.2) 
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Figure 2: The four types of maps considered in the conjecture in the case of a uniform 
density on [0,3]. 


We also introduce the set of transport plans 

n(p) = { 7 G : 7rj7 = p, i = 1 , 2 ,3 } , 

where tt* : (M+)^ —)■ M+ are the projections on the i-th component for i = 1,..., 3, and 
the Kantorovich multimarginal problem 

(K)=min< / c(ri, r 2 , rs) d 7 (ri, r 2 , ra) : 7 G n(p) L (2.3) 

i ^(M+)3 J 

2.2 Some special maps 

In the following definition, we introduce some special transport maps, which were 
conjectured in [18] to be good candidates for optimality in problem ( 2 . 2 ). 

Definition 2.1. Let p G ^(R+) be a non-atomic probability measure and let di,d 2 G 
M+ such that p([0,(ii]) = p{[di,d 2 ]) = p([(i 2 ,oo]) = 1/3. The DDI-map T : M+ —)■ M+ 
associated to p is the unique (up to p-negligible sets) map such that Tjjp = p and 

• T maps (0,(ii) onto (^ 1 ,^ 2 ) decreasingly, 

• T maps ((^ 1 ,^ 2 ) onto {d 2 ,oo) decreasingly, 

• T maps {d 2 ,oo) onto (0,(ii) increasingly. 

Similarly, we define, for instance, the DID-map mapping (0,(ii) onto (^ 1 ,^ 2 ) decreas¬ 
ingly, {di,d 2 ) onto {d 2 ,oo) increasingly and {d 2 ,oo) onto ( 0 ,(ii) decreasingly. 

The {D,/}^-class associated to p is composed by the maps with all the possible 
monotonicities, under the condition that T^^'> = Id: therefore we have III, IDD, DID 
and DDI, (see Figure 2). 

In the rest of the paper we answer the following question: 

Question 2.2. Is the DDI-map associated to p optimal in problem (2.2) for every 
measure p G <^(M+)? Is one of the maps in {D,/}^-class associated to p optimal in 
problem (2.2) for every non-atomic probability measure p G 
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2.3 A necessary condition for optimality: c-monotonicity 

Before presenting the examples and connterexamples, we recall a well-known optimality 
condition in optimal transport. 

Definition 2.3. Let c : (M+)^ —)■ [ 0 ,oo] be a cost function. We say that a set 
r C is c-monotone with respect to p C { 1 ,..., A} if 


c{x) + c{y)<c{X{x,y,p)) + c{Y{x,y,p)) 'ix,yeT, ( 2 . 4 ) 

where X{x,y,p),Y{x,y,p) G are obtained from x and y by exchanging their 

coordinates on the complement of p, namely 


Xi{x,y,p) = 


Xi if i G p 
Vi \ii^p 


Yi{x,y,p) = 


Vi if i G p 
Xi \i i ^ p 


Vi G A}. 


We say that L C is c-monotone if (2.4) holds true for every p C {1, ..., A}. 


(2.5) 


Let 7 G n(p) be a transport plan. The following Proposition ([14, Lemma 2], see also 
[ 5 , Proposition 2.2], where the result is used to describe optimal maps with Coulomb 
cost in 1 dimension) presents a necessary condition for optimality of 7. 

Proposition 2.4. Let c : (M+)^ —)■ [0, 00] he a continuous cost and let p be a probability 
measure on (M+). Let 7 G n(p) he an optimal transport plan for problem (2.3) and 
assume (K) < 00 (therefore 7 has finite cost). Then supp7 is c-monotone. 


Remark 2.5. Given an optimal plan 7, the snpport of 7 is c-monotone even in a stronger 
sense than the one in Definition 2.3. More precisely, given two points x and y (for 
simplicity, assnme that all their coordinates are distinct to avoid mnltiplicity issnes), we 
have that 

c{x) + c{y) < c{X) + c{Y) (2.6) 

for every choice of X,Y G (M+)^ such that the union of the coordinates of X and Y is 
the same as the nnion of the coordinates of x and y. Indeed, given any permntation a of 
the coordinates of (M+)'^, we have that CT{y) is in the snpport of the symmetrization of 
7, which is still optimal because of the symmetry of the optimal plan. Hence, applying 
Proposition 2.4 to x and o'{y), we obtain (2.6) for any X and Y. 


2.4 Counterexamples 

The first example shows that the DDI-map is not always optimal in problem ( 2 . 2 ), by 
taking as marginal a measnre which is concentrated in a small neighborhood of the nnit 
sphere. 

Counterexample 2.6. There exists e > 0 such that, setting 

Pe = dr G ./#(M+), 

the DDL-map associated to pe is not c-monotone and, therefore, not optimal in prob¬ 
lem (2.2). 
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(a) A configuration of three charges at the (^) ^ configuration of three charges at dis- 
same distance ri from the origin with angles tances ri, r2 and with angles 6*2 = tt and 
O2 = 2/37r and 0^ = —2/37r. ^3 = 0 - 

The proof is based on the analysis of c-monotonicity for similar radii, obtained by 
Taylor expanding the cost around the point (1,1,1). The analysis of c-monotone sets in 
this context suggests that the DID-map may be optimal in this example. 

The next example modifies the previous one by sending 1/6 of the total mass far 
away; in this way, the cost of the orbits of these points (which have two coordinates 
close to 1 and one large coordinate) can be easily computed. Thanks to this property, 
we can show that none of the maps in the {H,/}^-class can be optimal, since their 
support is not c-monotone. 

Counterexample 2.7. There exist M,e > 0 sueh that, setting 



none of the maps in the {D, I}^-elass assoeiated to pM,e is optimal in problem (2.2). 

Remark 2.8. In Remark 5.1 we will see a similar result for the problem with 4 marginals. 
However, we preferred to restrict the presentation to the case with 3 marginals since 
the ideas involved are the same, but the computations are easier. 

There are particular measures p for which the HHJ-map is c-monotone (whereas 
this property fails in Counterexample 2.6 and 2.7). For this reason one may expect that 
this map is also optimal in problem (2.2), but, to show this, sufficient conditions for 
optimality (stronger than c-monotonicity) would have to be identified. 

Proposition 2.9 (Examples of c-monotone HHJ-maps). There exists M > 0 such 
that for any probability measure p sueh that p{\l,2\) = /)([3,4]) = p{\M,co)) = 1/3 the 
DDI-map is c-monotone (according to Definition 2.3). 

3 Taylor expansion of the cost at ri = r 2 = T 3 = 1 

In this section we want to address the following problem: given three radii ri(t), r 2 {t) 
and r 3 (t) parametrized by t G M and starting from the value 1 at t = 0, what is the 


6 



expansion of c(ri{t),r 2 {t),r 2 {t)) in powers of f at f = 0 ? 

First, we notice that at t = 0 the optimal angles are ± 2 / 37 r and c(l, 1,1) = \/3. 
Indeed, given three unitary vectors vi,V 2 ,V 3 , calling ai, 02 , 0:3 the angles of the triangle 
with vertices vi,V 2 ,V 3 we have that |ui — U 2 I = 2 sin 03 (and cyclical) and therefore, by 
Jensen’s inequality and by the convexity of a 1 —)■ (sina)“^ in [ 0 , 7 r], 


+ 


+ 


■^2-^11 1^3 -^^ 2 ! I'Vl-'VS 


1 1 

= -y- 

9 ^ Cl' 


2 ^ sin Oi 


> 


2 sin((ai + 02 + 03 ) /3) 


= \/3, (3.1) 


with equality if and only if the triangle is equilateral. 

Taking the angles to be exactly ±2/3tt leads to the following cost 

. N 1 1 1 / N 

CA(n,r2,r3) := > c(ri,r2,r3). 

\Jr‘l + rir 2 + y ^^2 + y ?’i + + ’’3 

(3.2) 

However the inequality is strict as soon as the three radii are different and the approxi¬ 
mation of c with ca is too rough to deduce that they enjoy the same c-monotonicity 
structures. Therefore, we perform a finer analysis. 

We want to take into account only the first order variation of the radii as functions 
of t, so it is natural to consider three linearly varying radii 


ri(t) = 1-I-ait, r 2 {t) = l + a 2 t, r 3 {t) = I + a 3 t 
where 01 , 02,03 G M are some constants. To these radii we associate the exact cost 


5 ( 01 , 02 , 03 , t) = c(l -I- oit, 1 -I- a 2 t, 1 -I- azt), (3.3) 

and we study the expansion of this function near t = 0 . 

Lemma 3.1. Let 01 , 02,03 G M and let g be as in (3.3). Then we have that 


g{a, b, c, 0 ) = Vs. 

dg, ai + a 2 + 03 

^( 0 ,. 02 . 03 . 0 )---, 

d‘^g 4(af-I -02 + Og)-I- 6(0102 + 0203 + 0301 ) 

^(01,03, 03,0) = - ^ -, 

d'^g 308(af-h a|-h a|) 

gp(Ol, 03.03.0)=-- 


+ 


888(0^02 + 0102 + 0203 + 0203 -|- OgOi -|- Ogof) -|- 498010203 


375v/3 


(3.4) 


In the proof, we will write the Coulomb potential of three charges in terms of the 
distances from the origin and the angles between the charges. Given three radii ri, r 2 , 
rg and two angles 62 and 63 , we define the Coulomb potential of the configuration of 
charges depicted in Figure 1: 


C'(ri,r2,r3,6»2,6'3) 


1 1 
V2-Vl\ \V3-V2 


+ 


1 

|ui - V3 


(3.5) 
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where 

r;i = (ri, 0 ), r ;2 = r 2 (cos 02 , sin 02 ), ^3 = r 3 (cos 03 , sin 03 ). 

By definition of c, we notice that 


c(ri,r 2 ,r 3 )= min C(ri, r 2 , rs, 02 , 03 ). 


(3.6) 


Proof of Lemma 3.1. For t G M and 0 = ( 02 , 63 ) G we define also the function 
G{t, 0) = C{1 + ait, 1 + 02 ^, 1 + o-st, 02, 03 ). 

Then g{t) = G(t, 6 o{t)) where Oo{t) is the pair of angles which minimizes (3.6). From 
this optimality condition we know that 

Ge(t,6o{t)) = 0. 

We want to apply the implicit function theorem to find the behavior of 0o{t). It’s easy 
to check that 0o(O) = (2/37r, —2/37r) and a direct computation shows that 

G00(O,0o(O)) = ^ G Inv(M2;M2). 


Therefore 0o G G°°[{—e,e)) for some £ > 0 and we can compute its derivatives in 0. In 
particular, we have that 


^o(O) 


Gfe' ■ Gte 


(O,0o(O)) 


1 

5v/3 


/-Oi - 02 + 203 

y Oi - 2 o 2 + 03 


The idea is now to consider the first order approximation 


0(t) = 0o(O) + 0[,(O)t 


2/37r \ 1 /-oi - 02 + 2o3\ 

-2/37ry 5\/3 \ ai - 2 o2 + 03 y 


and the perturbed cost 

h{t) = G{t,e{t)). 

We claim that h{t) = g{t) + o{t^), namely 


(3.7) 


h(0) = 5(0), hfO) = g\0), h"(0) = 5 "( 0 ), h"\0) = /'(O). 


The first two are clearly true, since 0(0) = 0o(O) and 0^0) = 0o(O) by definition. Now 
consider the function 1 1 —)■ G(t, 0(t)), where 0 is either 0o or 0. To prove the claim, we 
show that its second and third derivatives at t = 0 depend only on 0 ^ 0 ) and not on the 
second and third derivatives of 0 . 

As a matter of fact, we have 


d^G{t,e{f)) 


i=0 


Gtt + 2Gtee' + Geee'e' + G,0" 
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but Gg(0,0(0)) = 0, so the second derivative does not depend on 0"(O). In a similar 
fashion, we have 

= Gitt+3Gtt6i0^+3Gi0£)(0^)^ + G0£)e(0^)^+3 {Gte + GegO') 6"+Gg9"' ^ 

t=o ~ 

Again, G0(O,0(O)) = 0, therefore 0'"(O) doesn’t contribute. Furthermore, we have 
Gg(t,6o{t)) = 0, so that differentiating in t yields 

^*0(0,00(0)) +G00(O,0o(O))0()(O) = 0. 


d^G{t,e{t)) 

dt^ 


But then also 

Gie (0,0(0))+Gee (0,0(0)) 0'(O) = O, 

since 0^(0) = 0o(O). Therefore we see that in both cases the coefficient of 9" vanishes. 
This concludes the proof of the claim because we have shown that the first three 
derivatives of h and g coincide at t = 0 . 

At this point the derivatives of h can be computed directly, since /i(ai, 02 , 03 , •) is 
an explicit function of the last variable. □ 

In Lemma 3.1 we found the first nontrivial Taylor term in the expansion of g{t). We 
employ this computation to obtain informations on the c-monotonicity of points with 
linearly spaced radii close to t = 0 . 

Lemma 3.2. For every t > 0, eonsider six linearly spaeed radii 

(r’i,r 2 ,r 3 ,r 4 ,r 5 ,r 6 ) = (1,1 + t, 1 + 2t, 1 + 3t, 1 + 4t, 1 + 5t). (3.8) 

Then there exists to > 0 sueh that, for every t < to, 

c{ri,ri,re) + c{r2,ro,r^) < c(ri,r4,r5) + c(r2,r3,r6). 

Proof. Let us define 

F{t) = g{0, 3,5, t) + g{l, 2,4, t) - g{0, 3,4, t) - g{l, 2, 5, t) 

Applying Lemma 3.1 we can compute the derivatives of F and find that 

F(0) = 0, F'(0)=0, F"(0) = 0, F'"(0) =-?^ < 0; 

this shows that F(t) < 0 for t sufficiently small and proves the lemma. □ 

Remark 3.3. Considering ri, ..■,ro as in (3.8), one could prove that the choice 146-235 
is optimal between all possible choices, namely 


c(ri,r4,r6) + c(r2,r3,r5) 

= min{ c(pi,P 2,P3) + c(p4,P5,P6) : {Pi, • • • = {n,-- • + 0 } } , (3.9) 

for t small enough. Moreover, one could see that (3.9) holds also if we replace c with 
CA defined in (3.2). This is, however, not needed for our counterexamples. 
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Remark 3.4 (Asymptotic expansion of the cost at infinity). Although they will not be 
used in the proofs of the main results, we report the following formulas since they might 
help in future studies to gain more insight into the structure of c-monotone sets. We 
are interested in the asymptotic expansion of the cost as some of the radii go to infinity 
and the others remain fixed. 

For (ri,r 2 ,r 3 ) = (1, l,r), the optimal angles are 

02(r) = 7r-^ + o(^^), 03 (r) = -^-l + o(^l^. 


In comparison to (3.7), this expansion is harder to justify (but can be easily verified 
numerically). However, from this fact it follows rigorously that the cost has the following 
asymptotic behaviour: 

c(l, 1, r) = (7(1,1, r, TT, - 7 r/ 2 ) - ^ + o 


= x + 


2 ^/IT 


4 

-j + o 


Similarly, for (ri,r 2 ,r 3 ) = (l,r, r), the optimal angles are 


vr 4 


2 r 


1 


TT 4 


6'2(r) = - + - + oH^ , Gz{r) = ---- + o[^ 


2 r 


.2 / ’ 


and the cost is 


c(l, r, r) = <7(1, r, r, 7 r/ 2 , -tt/2) - ^ + o 


1 


4 

- n O 


2 r 

Furthermore, one can verify that 

c(l,r,r) = (7 ('l,r,r, ^ -C> ) • 

V 2 r 2 r > < 


4 Condition for c = and CTr-monotonicity 


When the radii are spread apart, a reasonable approximate cost appears to be 


Cn{ri,r2,r3) 


1 1 1 
ri + r2 r2 + r3 - ri' 


which arises from collocating the charges at angles 62 = and ^3 = 0 (see Figure 3b). 
In the first part of this section we want to study under which condition on the radii ri, 
r 2 and rz we have 

c{ri,r2,r3) = c^(ri, r2, rs). 

We start with a heuristic argument involving a necessary condition. Up to permutations, 
we may assume ri < r 2 < r^. It is simple to check that 


C'0(ri,r2,r3,7r,O) = 0, 
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Figure 3: The region in the (r 2 ,r 3 ) plane where r 2 , r 3 , tt, 0) > 0, with ri = 1. 

The dotted line is r 3 = r 2 + 7. 


where C has been dehned in (3.5), either by direct computation or by a symmetry 
argument.^ If (02,^3) = (^jO) must be a minimum, then a necessary condition is 

Cdd(ri,r2,r3,7r,0) > 0 , 


in the sense that the Hessian matrix is positive-definite. We have 

^r 2 ( 


C'00(n,r2,r3,7r,O) = 


r-i r-3 

. (^i+’’2)® ' (r2+r3)^ 

r2r3 

(IT+Tip' 


rs 


r 2 r 3 

(r2+r3P 
r2 _ ri 

(Ti+TsT (rs-n)^ J 1 j 


since the hrst entry is positive, this 2x2 matrix is positive-dehnite if and only if the 
determinant is positive too, namely 


detC'0e(ri,r2,r3,7r,O) = 


^H2r3[r2r3(r2 - rs) ri(r| 5r2r3 r|) rf] ^ ^ 

(ri + r2)^(r2 + rsPirs - ri)^ ~ 


or equivalently 

ri(r 2 -h 5r2r3 -h r|) -h r? < r 2 r 3 (r 3 - r 2 ). 

Figure 3 depicts the region where the Hessian is positive. 

We partially justify the previous argument in the following lemma which, despite 
not being quantitative, will suffice for our purposes. 

Lemma 4.1. If 0 < < rf < r^, then there exists rf {rf ,rf ,r 2 ) such that 

for every ri G ^2 £ ^3 ^ we have 


c(ri,r2,r3) = c^(ri, r2, r3). 

^In fact, the four configurations with 62,03 £ { 0 , tt} are always stationary. 
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Figure 4: The curves in whose four intersections correspond to stationary points of 
C{ri,r2,r3,92,93). The two solid curves are defined by (4.1a). The dashed curves are 
defined by (4.1b). 


Proof. We denote by the 2-dimensional torus M^/(27rZ)^. The idea of the proof is 
the following: we claim that for sufficiently large there are exactly four stationary 
points (02,6*3) £ for (^(ri, r 2 , ra, 02, ^ 3 ), corresponding to 02,03 G {0,7r}. Therefore 
c(ri, r 2 , rs) must coincide with the value achieved at one of them and by comparing the 
four values we arrive at the desired conclusion. 

First of all, we compute the gradient 


C'0(n,r2,r3,02,03) = 


_ rir 2 sin(g 2 ) _ 

(^rf+r‘^- 2 rir 2 003 ( 62 ))^^^ 

_ rira sin(6>3) _ 

(r 2 -|-r|- 2 rir 3 003 ( 63 ))^^^ 


_ r2r3 sin(6>2-6>3) 

(r’2+f|-2r2r-3 003 ( 62 - 03 )) 

_ r2r3 sin(62-63) 

(r 2 -|-r|- 2 r 2 r -3 003(62-03)) 



The gradient vanishes if and only if the following equations are simultaneously satisfied: 


rir 2 sin( 02 ) 


(rf -I- — 2 rir 2 cos( 02 ))^^^ 

rira sin(03) 


+ 


nrs sin(03) 


(rf + Tg — 2 rir 3 cos( 03 ))^^^ 


+ 


' 2 rir 3 cos( 03 ))^'^^ 

r 2 r 3 sin (02 - 03 ) 


= 0 , 


(ri 


rg - 2r2r3Cos(02 - 03))^^^ 


= 0 . 


(4.1a) 

(4.1b) 


To show that there are exactly four stationary points, the idea is that, for rg sufficiently 
large, equations (4.1a) and (4.1b) define two pairs of closed curves on T^, of type 
(0,1) and (1,1) respectively, with the property that every curve from the first family 
intersects each curve of the second family in a single point. The situation is represented 
in Figure 4. 

Step 1. Given ri, r 2 and a sufficiently large rg, we claim that for every 0g G 
there are exactly two values 02 ( 03 ), 02 ( 6 * 3 ) £ *5^ which satisfy (4.1a); moreover 0 ^( 03 ) 
and 02 ( 03 ) are close to 0 and tt respectively by less than 0 (rf^'), uniformly in 03 , and 
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their derivatives go to to zero uniformly in 63 for rs —)■ 00 .^ These functions correspond 
to the solid, almost vertical, lines in Figure 4. 

We begin by finding a useful bound on |sin(02)|- The two terms of (4.1a) can be 
estimated by 


rir 2 sin( 02 ) 

(rf + rl — 2 rir 2 cos( 02 ))^'^^ 
rirs sin( 03 ) 


> 


(rf + - 2 rir 3 cos( 03 )) 


3/2 


< 


r2 |sin(02)| 

(rt + rt)^ ’ 

r+r^ 


{rs 


therefore, in order to have equality (4.1a), it must be that 


r^r2 |sin(g2)| ^ rfrs 
{rf + r^)^ “ {rs - r^)3 ’ 


that is 


|sin(6'2)| < 


r^{rf + r+f 
r^r^ 


rs 


{rs - rf)^ 


= 0 {rs^) 


(4.2) 


as r 3 —7- 00 , where the implied constant depends only on rf and r^. 

We have already discussed that, for every 0s G S^, the second term in (4.1a) is 
smaller than r 3 (r 3 — rf)~^ in magnitude. On the other hand, the first term vanishes 
for 02 = 0,7r and is equal to ±rir 2 (rf + for 62 = ±'7r/2. Therefore, by continuity, 

for rs large we have at least two solutions to (4.1a). 

The estimate on |sin(02)| proves that the solutions must be located near 0 and tt. 
Now we want to prove that there are exactly two of them. To do so, we verify that the 
partial derivative with respect to 02 of the first term in (4.1a) is different from zero for 
02 in the prescribed intervals around 0 and vr. Indeed, the derivative is 


d 

1 

' rir2sin(02) \ 

rir2 

802 

6»2=0 

V(rf + rl - 2rir2cos(02))^^^/ 

1 

to 

1 

CO 

8 

1 

^ rir2sin(02) \ 

_ r2 

802 

92 =tt 

V {rl + ^2 - 2rir2cos(02))^^^/ 

(ri + r2)3 


therefore it is different from zero around the two points and the two solutions are simple. 

The claim is almost entirely proved. We now have the two functions 02(' )> ^2 (') 
and the last thing that we want to derive is the estimate of their first derivatives. Let 
02( •) be one of the two functions. Thanks to the implicit function theorem, we know 
that 02 (•) is at least and we can compute 


Q' (Q ') = _!!. 2 (rf + r|) cos(03) + rir 3[-5 + cos(203)] 

^ ^ r 2 2 (rf+ r|)cos(02(03))+rir2[-5 + cos(202(03))] 

/rf + r| - 2rir2 cos(02(03)) \ 
y rf + rj- 2rirs cos{0s) j 

^More precisely, they are close to zero by less than 0 (r^^), uniformly in @3. 
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All the terms are fairly easy to deal with, apart from the denominator of the second 
fraction. However, we have that 


2(rf + r|) cos( 6'2) + rir2[-5 + cos(26'2)] = 2(rf - 2rir2 + rl) > 2(r2 - 

-2(rf + r|)cos(02) - r’ir2[-5 + cos(26'2)] = 2(rf + 2rir2 + r^) > 2(r^ +r^)^. 


therefore, by the continuity of the functions involved and by compactness, there exists 
a neighbourhood U of {0,7r} such that if ri G ^ 02 ^ U then 


2(rf + r|) cos( 6*2) + rir2[-5 + cos(26'2)] 


> (^2 -’’i ) 


+ \2 


From this and (4.2), which ensures that 02 (^ 3 ) £ U, we deduce that for rs large 


02{03)\ < ^ 

^2 


2{rff + 2rl 

(^2- -r^)2 


jrf + ^2^)^ 

(rs - rf)^ 



Step 2. Next we perform the same analysis for (4.1b). We prove that there exist 
two functions @ 2 (^ 3 ) ^ 2 (^ 3 ) which are the only solutions of (4.1b) when 63 is 

prescribed and that their derivatives are strictly positive. First of all, we introduce the 
new variable ^ = 62 — 03 - Equation (4.1b) reads as 


ri sin(6'3) 


(rf + rg — 2rir3 cos(03))^'^^ (r^ + rf — 2r2r3 cos('0)) 


+ 


r2 sin(^/)) 


3/2 


= 0 . 


(4.3) 


• The solutions lie in two strips. From equation (4.3) we get 


{rs - rj)^ 


> 


ri sin(6'3) 


(rf + rg - 2rir3 cos(03))^^^ 
r2 sin(^/)) 


(rf + r| - 2r2rs cos('0)) 


3/2 


> 


r 2 |sin(V>)| 

(rf" + rs)^ 


Therefore we have 


|sin('0)| < 


rs + rf \ rf 
rs -rf j 


which, for rs sufficiently large, implies |sin(f))| < r/ for a fixed tj G (rf/r 2 ,1). 
There are at least two solutions. The first term of (4.3) is bounded by 


ri sin( 03 ) 


(1 + r| - 2r3Cos(6'3)) 


3/2 


< 


ri 


(rs - 1)^’ 


On the other hand, when if: = ±7r/2 the second term equals 


±_^2_ 

H + r|)3/2' 

which is bigger for rg large enough. This tells us that for every 0g there are at least 
two distinct values of f; which solve (4.3), because the second term is a continuous 
periodic function of tp. 
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• There are exactly two solutions. The derivative of the second term is 

d I r 2 sin('0) 

I (r| + — 2 r 2 r 3 cos(V'))^/^ 

We observe that the denominator is always positive. We study the sign of the 
numerator. The equation 

-3r|r3 + (r2 + r2r|)t + = 0 

for the unknown t has the two solutions 

-r 2 -ri + ^4 + 14r2r| + r| -r| - r| - ^r^ + Ur^rl + r^ 

2r2r3 ’ 2r2r3 

However, only the first one lies in the range [—1,1], whereas the second is less 
than —2. In fact, 

r| + r| + + 14r|r| + r| > + r| + + 2r|r| + r| = 2(r| + r|) > 4r2r3. 

Therefore the function has exactly two stationary points and is monotone between 
them. 


—Sr^rs + (rf + r 2 r‘^) cos(' 0 ) + r 2 r^ cos{'ip)^ 
(r| + r| — 2r2r3 cos(^/;))^/^ 


• Derivative of the solutions. At this point we know that there exist two 
functions V’o(^ 3 ) and such that the corresponding 02 (^ 3 ) = + ^3 and 

^ 2 (^ 3 ) — P’-n-iGs) + O 3 parametrize the solutions of (4.1b). 

The goal is to show that for r 3 sufficiently large we have ^ 2 (^ 3 ) ^ C* > 0 for some 
constant C independent of r 3 , where 6*2 (•) is either 0° (') ^2 (') • Thanks to 

the implicit function theorem we can compute the derivative 


^ 2 (^ 3 ) = 


(rl + ^3 — 2r2r3 cos('0))^/^ 


—Sr^rs + (rf + r 2 r 3 ) cos(V') + cos(V’)^ 

( ri cos{6z) r 2 cos(V') 


(rf + r-g — 2 rir 3 cos{9s))^/‘^ (r^ + ^’3 — 2 r 2 r 3 cos(V'))^/^ 

Srfrs 3r2r3 

(rf + rg — 2 rir 3 cos( 03))®/2 (r^ + — 2 r 2 r 3 cos(V’))®/^ 


where 'ip = 62 — 63 a,s before. We introduce the parameter k = l/r^ and write the 
derivative in terms of it. We have that 


^ 2 (^ 3 ) = f{ri,r 2 ,l/r 3 ,e 2 - 03,63) 

where 


f{ri,r 2 ,K,Tp, 63 ) 


(1 — 2r2Kcos('0) + 

— 3 r 2 K + + r 2 ) cos{ip) + r 2 Kcos(V’)^ 


_ ri cos( 6 > 3 ) _ 

(1 + _ 2riKcos(03))3/2 

3rf/tsin(6<3)^ 

(1 + _ 2 ri/tcos( 03))®/2 


r 2 cos(' 0 ) 

(1 + — 2r2KCOs(V'))^/^ 

3r2nsm{ip)‘^ 

(1 + — 2r2Kcos(V'))^/^ 


(4.4) 
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Observe that the only singularities are due to the denominator of the first fraction. 
However, the singular values of V’ he outside the two intervals 

5 = [— arcsin(?7),arcsin(r/)] U [vr — arcsin(?7), vr + arcsin(77)] 

for K sufficiently small (ra large enough), because they converge to ±7r/2. Therefore 
there exists k"*" > 0 such that the function / is continuous in the domain 

D = [r];,rf]ri X [r^,r^]r2 x [ 0 ,/i+]K x x [O,27r]03. 

• Limit case. We rewrite equation ( 4 . 3 ) in terms of k as 

_ n sinjes) _ _ r2 sin(V;) _ ^ 

(1 + — 2riKcos(03))3/2 (1 + — 2r2Kcos(V’))^/^ 

Let rri,r2,K denote the set of solutions (■0,6*3) G x [O,27r]03 to ( 4 . 5 ). By the 
continuity of ( 4 . 5 ) we know that 

r= U U U ^rur2 ,nCD 

rie[rj",r+] r2e[r^,r0] Ke[0,K+] 

is a closed set. Our ultimate goal is to show that / is positive on when k 

is small enough. 

We start by studying the limit case k = 0 . The limit curve ^rx,r2fl is given by the 
equation 

risin(6*3) = r 2 sin( 0 ). ( 4 - 6 ) 

For K = 0 , the function / equals 

/(ri,r2,O,0,6»3) = -— (ricos(6»3) +r2cos(0)) = 1 + 

r2 cos('0j r2 cos('0j 

We claim that this function is positive on the curve defined by ( 4 . 6 ). Indeed, 
positivity is guaranteed if we are able to prove that 

n cos(6*3) ^ ^ 
r2 cos(0) 

But, by squaring, this is equivalent to 

rf cos( 6*3)^ < r2cos(0)^, 

which, thanks to ( 4 . 6 ), reduces to the true inequality rf < r^. 

• Conclusion. Finally, we prove that / > C > 0 on for k close to zero, 

where C is a constant depending only on rf^ and r^. 

We know that / is positive on the compact set 

K= \J U F,3,,3,o. 

ne[rf ,r+] r2&[r-,r+] 

Therefore there exists a positive constant C and an open neighbourhood U of K 
in D such that f > C on U. Since F is closed, a compactness argument shows 
that Tr^^r2,K C U for n close to zero and this concludes the proof. 
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Step 3. The previous steps tell us that (4.1a) defines two vertical curves and (4.1b) 
two diagonal curves. The estimates on the derivatives of such curves prove that the 
intersections are simple, therefore there are exactly four stationary points. But we 
already know four stationary points, namely 

(^* 2 , 03 ) = (0,0), ( 0 , 7 r), ( 7 r, 0 ), ( 7 r, 7 r). 

To conclude, we can just compare the costs associated to each of them and pick the 
smallest one. It is easy to see that ( 6 * 2 , 6 * 3 ) = (vr,0) is the optimal choice. In fact, (0,0) 
is clearly the worst. Among the three cases left, we can say that (vr, 0) always beats 
(tt, tt), that is 


C'(n,r 2 ,r 3 , 7 r, 7 r) - (^(ri, r 2 , rs, vr, 0 ) 


1 

n - r -2 


rs - ri 


+ 


1 

r2 + ri 


1 

r-s + r2 


> 0 , 


as both the differences in parenthesis are positive. Finally, (tt, 0) beats (0, vr) too because 


C'(n,r 2 ,r 3 , 0 , 7 r) - (^(ri, r 2 , rs, vr, 0 ) 


2 ri(r| - rl) 

{rl - rl){rl - rl) 


□ 


In the following lemma, we prove that, with the frozen cost Ctt, given six increasing 
radii numbered 1,...,6 the choice of two disjoint subsets of three elements which 
minimizes the cost is always given by 145 and 236. Actually, we prove only some 
comparisons that are enough for our examples, but one could show in general that 


c,r(?'i, r4, rs) + c^(r 2 , rs, re) = 

= min{ c^(pi,P 2 ,P 3 ) + c,r(p 4 ,P 5 ,T 6 ) : {pi, • • • ,Pg} = {n,... ,r 6 } } . 


The proof of this fact reduces to the characterization of c-monotonicity with Coulomb 
cost performed in [5, Proposition 2.4]. 

Lemma 4.2. Let 0 < ri < • • • < re. Then we have that 

c,r(ri,r 4 ,r 5 ) + c^(r 2 , rs, re) < min {cvr(ri, r 4 , re) + C 7 r(r 2 , rs, rs), 

C 7 r(n,r’ 3 ,r 6 ) + c^(r 2 ,r 4 ,rs), c^(ri,r 3 ,r 5 ) + c^(r 2 ,r 4 ,re)}. 


Proof. Let us consider the one dimensional Coulomb cost defined in M 


c{vi,V2,V3) 


1 

V 2 - Vi 


+ 


1 

\V3 - V 2 


+ 


1 

|ri - V 3 


yvi,V2,V3 G M. 


We notice that c^(ri,r 4 ,r 5 ) = c(ri,—r 4 ,r 5 ) and, more in general, for all the 3-uples 
appearing in (4.7) the C 7 r-cost and the c-cost satisfy the same relation. In [5, Proposition 
2.4] it is proved that, given the six points —r 4 , —rs, ri, r 2 , rs, re the best way to choose 
two 3-uples to minimize the one dimensional Coulomb cost is to take the points in odd 
position and the points in even position; in particular, we have 


c(-r 4 ,ri,r 5 ) + c(-r 3 ,r 2 ,re) < min {c(-r 4 , ri, re) + c(-r 3 , r 2 , rs), 

c(-r'3,ri,re) + c(-r4, r 2 , rs), c(-r 3 ,ri,r 5 ) + c(-r4, r 2 , re)}. 


which proves (4.7). 


□ 
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Remark 4.3. The previous lemma allows to prove that, for the cost Ctt, the symmetrized 
optimal plan for the problem (2.3) is unique and coincides with the symmetrization of 
the DDI-map. 


5 Proofs of examples and counterexamples 

Proof of Counterexample 2.6. Let to be given by Lemma 3.2 and let us choose e < to/2. 
If, by contradiction, the DDI-map T associated to pe is optimal, by Proposition 2.4 its 
support is c-monotone. Let us consider 1 + e, 1 + 3e and the images of these points 
through T and T oT: 


r(l + e) = l + 7e, ror(l + e) = l + 9e, 

r(l + 3e) = l + 5e, T o r(l + 3e) = 1 + lie. 

We notice that these points 

(ri,...,r@) = (l + e,l + 3e,l + 5e,l + 7e,l + 9e,l + lle), 

are equally spaced; hence, we can apply the scaling properties of the cost function and 
Lemma 3.2 with t = 2e/(l + e) < to to deduce that. 


c(ri,r4,r6)+ c(r2,r3,r5) = 


< 


1 

\c( 

r4 

re A 

1 +c( 

' r2 

rs 

r5 \1 

1 -\- s 


1 + e’ 

1 + e^ 

a + e’ 1 + e’ 

l + e^J 

1 1 

\c( 

r4 

r5 \ 

1 +c( 

' r2 

rs 

re \l 

1 + s 

^ Vi + e’ 

1 + e’ 

1 + e^ 

a + e’ 1 + e’ 

l + e)\ 


= c(ri,r4,r5) + c(r2,r3,r6). 


This contradicts the c-monotonicity of the support by taking p = {3}. □ 

Proof of Counterexample 2. 7. Step 1. By choosing e sufficiently small (independently 
on M), we exclude that the DDI-vnap is optimal in problem (2.2) for every M > 2. 

Let T be the piecewise continuous DDI-map. Consider the following two points in 
the support of the plan associated to T (recall that the support is a closed set): 



T 








lim (r, T{r),T^‘^\r)) = (1 + e, 1 + 3e, 1 + 5e). 
1->■!+£“ 


We claim that they violate the c-monotonicity property (Proposition 2.4) with p = {3}, 
namely 


/ r 96 \ 

/(e) = c ^1 + -, 1 + —, 1 + —j + c(l + e, 1 + 3e, 1 + 5e) 


7e 


cl + -,l + —,l + 5e +cl + e,l + 3e,l + — 


9e 


> 0 
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for e sufficiently small. The proof is similar to that of Lemma 3.2. Using the formulas 
obtained in Lemma 3.1 we just compute the derivatives 


/(O) = /'(O) = /"(O) = 0, 


r(o) 


71v/3 

100 


> 0. 


Step 2. We exclude that the maps DID, IDD, III in the {D, /}^-class are optimal 
in problem (2.2) for M large enough. 

We present the argument to exclude the DID-ma,p, the others being similar. Let us 
fix X, 2 / G (M +1/4, M + 3/4), x < y, and let us consider their orbits through T, that is 
T{x),T{y) G (1,1 + eo) and T(^^(x),G (1 + 3eo, 1 + 4eo)- Let us consider the 
increasingly ordered points 


(ri,...,r6) = {T{y),T{x),T^^\x),T^^\y),x,y)- 


the couples of points (ri,r 4 ,r 6 ) and (r 2 ,r 3 ,r 5 ) belong to the support of the plan 
associated to the DID-m.ap. By Lemma 4.1, we can choose M sufficiently large so that 
the previous points, as well as the points (ri,r 4 ,r 5 ) and (r 2 ,r 3 ,r 6 ), have the same c 
and Ctt cost. By Lemma 4.2, which describes the c,r monotonicity, we have 


c(ri,r4,r5) + c(r2,r3,r6) = c^(ri,r4,r5) + c,r(r2,r'3U6) 

< c^(ri,r4,r6) + c^(r2,r3,r5) 

= c(ri, r4, re) + c(r 2 , rs, rs). 

This shows, by Proposition 2.4, that the DID-map cannot be optimal. □ 

Remark 5.1. Our method can be applied to the 4-marginal problem to show that there 
exists e > 0 such that, setting 

Pe = Y^^[i4+i6e] dr G 

any map in the {D, /}^-class associated to ps is not optimal in problem (2.2). Indeed, 
let T be any such map. Pick two points in [1,1 + 16e] such that the union of their two 
orbits is 


-[ri,..., rg} — j^l + s, 1 + 3s, 1 + 5s, 1 + 7s, 1 + 9s, 1 + 11s, 1 + 13s, 1 + 15sj-. 

We claim that T is not c-monotone because the partitioning of {ri,..., rg} into two 
quartets that minimizes 


c(+i, +2 , +3 , ri J + c(rig, rig, ri^, +3 ) 

is {(ri, rs, re, ry), (r 2 , rg, r 4 , rg)} and such partition doesn’t correspond to any of the 
maps in the {H,/}^-class. 

The way to see this is to extend the results of Section 3 to the 4-marginal case. 
Consider four radii 

(nU2,r3,r4) = (1 + ait, 1 + a2t, 1 + agt, 1 + a4t). 
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Following the same derivation, we find that the angles that give the cost c are 


( vr /2 \ 0 _ ^ - 02 + as + aA 

^ 3 (i) = vr H- 204 - 202 t + o(t). 

\Oi{t) J \ 3 / 37 ry y Ol - 02 - 03 + 04 / 

In turn, this provides the expansion of the cost up to the third order and this information 
can be used to verify the asymptotic optimality of any given partition. We omit the 
formulas, since this computations are better performed with the aid of a computer 
algebra system. 

Proof of Proposition 2.9. Let M be chosen, thanks to Lemma 4.1, so that 

c(ri,r 2 ,r 3 ) = c^(ri,r 2 ,r 3 ) for every n G [1,2], r 2 G [3,4], rs G [M,oo). (5.1) 

In order to prove the c-monotonicity property, since the map T is cyclical and since its 
orbits take exactly one point in each interval [1,2], [3,4], and [M, oo), it is enough to 
show that, given x,y G [1,2], x < y, we have 

c(x, r(x), r(2)(x)) + c{y, T{y),A^\y)) < c{x, yl, B) + c{y, C, D) (5.2) 

for every possible choice of A, B, C, D such that {A, C} = {T(x),T(y)} and {B, D} = 
{T^'^\x),T^'^\y)}. By definition, we have that 

1<x<2/<2<3< T{y) < T{x) < 4 < M < A‘^\x) < A‘^\y)-, 

hence by (5.1) we have that c{x,T{x),T^‘^\x)) = Cn{x,T{x),T^‘^\x)) (and similarly for 
y and for the other 3-uples) and by Lemma 4.2 we have that 

c(x, r(x), r(2)(x)) + c{y, r(y), r(2)(y)) = c^x, T(x), r(2)(x)) + c^y, r(y), T(2)(y)) 

< Cn { x , A, B) + c,r(y, c , D) = c { x . A, B) + c(y, C, D), 

for every possible choice of A, B, C, D such that {A, C} = {T{x),T{y)} and {B, D} = 
{r*^^)(x),T(^)(y)}; this proves (5.2). □ 
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